Picture for Di He

Di He

SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning

Add code
Feb 02, 2026
Viaarxiv icon

Towards Solving the Gilbert-Pollak Conjecture via Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

The AI Hippocampus: How Far are We From Human Memory?

Add code
Jan 14, 2026
Viaarxiv icon

Luminark: Training-free, Probabilistically-Certified Watermarking for General Vision Generative Models

Add code
Jan 03, 2026
Viaarxiv icon

Efficient Reasoning for Large Reasoning Language Models via Certainty-Guided Reflection Suppression

Add code
Aug 07, 2025
Viaarxiv icon

Solving the Hubbard model with Neural Quantum States

Add code
Jul 03, 2025
Viaarxiv icon

AlphaDecay:Module-wise Weight Decay for Heavy-Tailed Balancing in LLMs

Add code
Jun 17, 2025
Viaarxiv icon

Diagnosing and Improving Diffusion Models by Estimating the Optimal Loss Value

Add code
Jun 16, 2025
Viaarxiv icon

Theoretical Benefit and Limitation of Diffusion Language Model

Add code
Feb 13, 2025
Figure 1 for Theoretical Benefit and Limitation of Diffusion Language Model
Figure 2 for Theoretical Benefit and Limitation of Diffusion Language Model
Figure 3 for Theoretical Benefit and Limitation of Diffusion Language Model
Figure 4 for Theoretical Benefit and Limitation of Diffusion Language Model
Viaarxiv icon

Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning

Add code
Feb 12, 2025
Figure 1 for Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning
Figure 2 for Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning
Figure 3 for Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning
Figure 4 for Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning
Viaarxiv icon